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Abstract 

Recommendation systems represent an important tool for news distribution 
on the Internet. In this work we modify a recently proposed social recom- 
mendation model in order to deal with no explicit ratings of users on news. 
The model consists of a network of users which continually adapts in or- 
der to achieve an efficient news traffic. To optimize network's topology we 
propose different stochastic algorithms that are scalable with respect to the 
network's size. Agent-based simulations reveal the features and the perfor- 
mance of these algorithms. To overcome the resultant drawbacks of each 
method we introduce two improved algorithms and show that they can opti- 
mize network's topology almost as fast and effectively as other not-scalable 
methods that make use of much more information. 

Keywords: recommender systems, social recommendation, adaptive 
networks 



1. Introduction 

In a news distribution system with a tremendous number of users, such as 
digg.com and reddit.com, thousands of news are released every hour. When 
a user logs on the system without any support, he is drowned in the flood of 
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information immediately Recommender systems emerged to help users 
deal with the information overload and find out content matching their inter- 
ests [21, S 0] • In traditional recommender systems relevant news are delivered 
to users according to their profiles and historical activities, with recommenda- 
tion mechanism designed in a centralized way: data analysis and recommen- 
dation decision are made by the system for the whole network. In contrast, 
in social recommender systems users submit and forward news to others, as 
in a peer-to-peer network. The core idea of peer-to-peer is that all nodes in 
the system have the same function: each node can both gain service from 
other nodes, as a client, and provide service to others, as a server. 

In this work we present a modified version of a previously introduced 
news recommender system |5|. This system is naturally implemented in a 
peer-to-peer environmenlQ: each user, considered as a node of the network, 
does not get news directly from some central servers, but he can post news 
and recommend them to other users (i.e. his followers) and at the same time 
he receives news from other users (i.e. his leaders). Hence each node acts as 
a follower and at the same time can become a leader in the network. When 
a user receives a news from his leader and likes it, he forwards it to his fol- 
lowers; then, if these followers like the news too, they forward it further to 
their followers, and so on. Hence news diffusion over the network resembles 
an epidemic spreading [g], 0]- As in peer-to-peer systems, the leader- follower 
network is varying in topology: links between leaders and followers may be 
changed as the network evolves. The system continually refines the neigh- 
borhood of each user until connections between taste mates are established, 
so that news can spread quickly from the original source to the users who 
are most interested in it. Network evolution is therefore driven from an in- 
terplay between topology and dynamics j^. The leader updating procedure 
is a crucial part of the whole recommender mechanism. An effective updat- 
ing method should choose for each user a leader with very similar tastes at 
very low cost. In this work we compare several heuristic methods in detail. 
We discuss their efficiency, performance and scalability (which are impor- 
tant aspects for real systems), and we analyze the properties of the evolving 
network structure with agent-based simulations. A comparison with other 



^The peer-to-peer model presented here can be developed in one of two possible ways: 
as a centralized system (with all information stored in a central server) , or in a real peer- 
to-peer system, where information about a node is actually stored in the peer node itself 
and each node both provides and gains services from other nodes. 



2 



widely adopted popularity-based recommendation methods has already been 
discussed in 

This work is organized as follows. In Section 2 we present related work. 
In Section 3 we describe the news recommendation method and an agent- 
based model to test it. In Section 4 we report simulation results for network's 
evolution with different updating methods. In Section 5 we introduce two 
refined updating methods. Lastly, we conclude with discussion. 



2. Related Works 



So far, plenty of recommendation techniques have been proposed by 
researchers. These techniques can be classified in three main categories: 
content-based, collaborative filtering and social filtering methods. 

Content-based methods [ol, try to classify items according to their 
content (e.g. by keywords) and to extract users' preferences from their pick 
history. These two elements are used to obtain recommendation: items rec- 
ommend to a user are the ones whose characteristics match user's preferences. 

Collaborative filtering methods make predictions about the interests of a 
user by collecting taste information from the community the user belongs to, 
with the underlying assumption that those users who agreed in the past tend 
to agree again in the future. In memory-based collaborative filtering, such as 



in Amazon.com the recommendation score for an item is computed as a 
weighted average of ratings given by other users with weights proportional to 
the similarity between users Hj- In model-based collaborative filtering meth- 



ods (e.g. Bayesian clustering [12| and PLSI [13]) users are classified according 



to their preferences. Diffusion-based methods, where heat conduction [14 
random walk 15| or hybrid processes 16| are employed to efficiently extract 
the taste similarity between users or items, can be considered as extensions 
of collaborative filtering. 

Social filtering methods make use of users' relationship in a social network 
to obtain recommendations. These methods are based on the assumption 
that users prefer recommendations made by a friend than those by a cen- 
tralized system 17(|. In other words, social infiuence is more determinant for 
users to decide what to read than similarity of past activities 18|, |l9| . Gen- 



erally, a social recommender system can be devised in two different ways: 
pull and push. In the first case, each user selects other users as his infor- 
mation sources and he can pull information from these sources (as in, e.g., 
Delicious.com). In the latter, each user can push what he likes to other users 



3 



who have accepted him as an information source (as in, e.g., Twitter.com). 
In many of these systems users have also the possibihty to assign information 
with user-defined words, so-called tags, that allow a more effective manage- 



ment of resources by users [20|, |2l|, |22 



Different from all methods mentioned above, the news recommendation 
mechanism we study in this work combines memory-based and social rec- 
ommendation together with network's topology optimization. Recommen- 
dation scores are computed using users' past ratings, though recommenda- 
tions come only from the information sources a user selects; in parallel, the 
leader-follower network is continually refined according to similarity of user 
tastes, in order for news to be efficiently delivered to their eagerest readers. 
This combination of social and similarity recommendation turns out to be 
particularly effective: the presence of directed links among users makes the 
recommendation process naturally fast and scalable, while the use of rating 
patterns' similarity in the information diffusion process enhance users' social 
contacts and experience. 



3. Model Description 

In this section we describe the news recommendation mechanism [sj that 
provides a personalized set of news for each user. As stated in the introduc- 
tion, the system consists of a set of users with distinct tastes who continually 
post and evaluate various news items. 

3.1. Similarity Estimation 

The taste similarity between users is predicted using statistical correlation 
of users' rating histories. We adopt Pearson correlation coefficient: for a pair 
of users i and j 

where rj and Vj^c are the ratings given by these users to news c and f j and 
fj are the average ratings of i and j. Sij G [—1, 1], with Sjj > indicating 
that user i and j tend to agree in evaluations of news while with Sij < that 
they tend to disagree. Despite the network is directed, Sij = Sj^i. 

The similarity score ([1]) is different from the one used in jsj. This is 
because we assume that users evaluate news in a binary scale: rj^c = 1 if user 
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i likes news c and otherwise r^^c = 0. In other words, we do not distinguish 
between a dishked and not rated news since in many real news distributing 
systems the only available information about users' ratings is users' click 
history. Therefore the simples possible assumption is to treat a user's click 
on a news as a positive vote for the news itself, while any information about 
users' negative interests remain undiscovered. 

The choice of Pearson correlation ([T]) as similarity metric is motivated by 
the fact that it can be used also in systems where users gives explicit ratings 
to news. However the choice of other metrics, such as Jaccard coefficient 
or Cosine similarity, does not alter the behavior and the features of the 
system. This statements also holds for asymmetric similarity metrics (as for 
example the asymmetric Jaccard coefficient), because the difference of rating 
patterns between similar users becomes smaller and smaller as they evaluate 
more news. This causes the asymmetric Sij and sj^i to converge to the same 
value, i.e. to a symmetric coefficient. 

We use some heuristics methods to avoid problems related to data spar- 
sity. If two users have no evaluations Sij is undefined. Therefore at the 
beginning of system's evolution we set Sij = Sq, with sq a small fixed value, 
for each pair of users. Sij is undefined also when a user gives all positive or 
negative evaluations. In this case we set Sjj = — 1 to prevent this kind of ma- 
licious behavior (i.e. giving ratings that do not bring any information about 
the user's interests) from affecting recommender system's performance. 

Note that for the sake of system scalability we do not store the similarity 
scores between all user pairs but only the reading history of users, from 
which the similarity scores are computed. In other words, the information is 
distributed over the network: the click history of each node, which is stored 
in the peer node itself, is available to other nodes and can be retrieved by 
them in order to perform local tasks like assigning recommendation scores to 
incoming news or updating connections to other nodes. 

3.2. News Recommendations and Network Updating 

When a user logs into the recommender system, the system checks all 
the candidate unread news forwarded by his leaders in his waiting list and 



then selects the top k [23[ of them to recommend him according to their 
recommendation scores. The recommendation score of each news is equal to 
the similarity between the user and the leader who forwarded it, so that a 
high recommendation score is assigned to the news sent by leaders with high 
similarity. When the same news is recommended to a user from multiple 
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sources, recommendation scores are not summed up, as in [s!], to prevent 
Eclipse attacks on the system {22]. Instead, each news can be recommended 
only once to each user. When the user likes a recommended news, he will 
forward it to the waiting list of his followers. Also, he can introduce a new 
news item in the system and forward it to his followers. We employ a book- 
keeping mechanism to make sure that each user does not receive news he has 
already read. Besides, users' waiting lists are of limited length, and when 
they are completely filled with candidate news the later arrived (i.e. the 
fresh news) have few chances to jump on top of the recommender list and be 
consequently read. To allow fresh contents to be more visible, the waiting 
list of a user is cleaned up after the user gets offline. Other mechanisms to 
promote the diffusion of novel news items are discussed in [sj. 

As news spread over the network and get evaluated, taste differences 
between users and their leaders can be gradually identified. We use this in- 
formation to refine the local neighborhood network and improve news prop- 
agation: from time to time we find the least similar leaders of all users and 
replace them with new ones. In what follows we will compare several updat- 
ing methods in detail. 

We remark that the present model differs from the one described in jsf 
under several aspects, as we make use of much less information, for instance: 
ratings are binary, the different recommendation scores for a news are not 
aggregated, recommendation list of users are cleaned every time they get 
offline. By keeping only the necessary information and using fewer compu- 
tational resources, we want our system to be efflcient and fast as well as 
effective and robust. 

3.3. Agent-based Modeling 

We make use of an agent-based approach to test the model. In order to 
depict the tastes of users and the attributes of news, we use D-dimensional 
vectors containing only or 1. User i's taste vector (tj) describes what 
topics he likes (1) or dislikes (0). The maximum number of distinct taste 
vectors is 2^. In |5| an alternative setting with a fixed number of Is in each 
vector was used. However the present scenario better depicts users who are 
heterogeneous in their scope of interests. 



^In an Eclipse attack a modest number of malicious nodes conspire to fool correct nodes 
into adopting the malicious nodes as their peers, with the goal of dominating the neighbor 
sets of all correct nodes. 
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At the beginning of the simulation, L = D randomly selected leaders are 
assigned to each user. The similarity between each user and his newly selected 
leaders is set to the uniform default value sq. At every running step each 
user gets online with probability po and, if active, submits a new news item 
with probability ps- The attribute vector of the submitted news is identical 
to the taste vector of the user who submitted it. When a user is online, 
he reads and evaluates the top k news recommended by his leaders. The 
overlap between user's tastes and news' attributes, computed by counting 
the number of identical bits in the corresponding vectors, is used to obtain 
user's evaluation on the news item. Agent i likes news c with attribute vector 
dc if: 

\\ti-d,\\<9 (2) 

here, || ■ || is the norm of the vector and 6 is the approval threshold. If agent i 
likes news c, then r^c = 1; if « dislikes c or if i has not evaluated c yet, r^c = 0. 
If a user likes a news, he forwards it to the waiting lists of his followers. Users' 
assessments on news are used to compute the similarity scores between users 
and their leaders. The leader updating procedure takes place for each user 
after this user gets online u times. 

3.4- Performance Metrics 

For any recommender system, the approval fraction, i.e. how often users 
are satisfied with the news they get recommended, is the most important 
performance measure. Besides, we focus mainly on the properties of the 
leader-follower network. Since in a computer simulation we have the luxury 
of knowing the taste vectors of users, we can compute the average value of 
the taste vector differences between a user and his leaders. Since there are no 
identical users in the system, the baseline value of these differences is 1 and 
we define the excess differences as the average number of differences minus 1. 
This measure shows the intrinsic quality of the leadership in the network: the 
lower the average differences, the higher the taste likeness between leaders 
and their followers. 

4. Simulation Analysis 

In our simulations we assume 13-dimensional taste vectors and hence the 
system consists of 8192 users. For other simulation parameters, see table [H 
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Table 1: Simulation parameters and their default values 



Parameter 


Notation 


Value 


Network size 


iV = 2^ 


8192 


Length of users' taste / news' attribute vector 


D 


13 


Number of leaders for each user 


L 


13 


Onhne probabihty at each step 


Po 


0.02 


Submitting probabihty when onhne 


Ps 


0.01 


Period of leader updating 


u 


10 


Length of waiting lists 


I 


100 


News read by users when online 


k 


10 


Acceptance threshold 


e 


6.5 


Initial similarity value 


So 


0.1 



4--1- Leader Updating Methods 

Updating the leader-follower network consists in finding for a specific user 
i the least similar leader (e.g. user j) and replace this leader with a new one 
(e.g. user k). Some straightforward methods to select the new leader are: 

• Pure Random: the new leader is simply a randomly selected user; 

• Selective Random: the new leader is selected as in the Pure Random 
method, but he replaces the old leader only if he has a higher similarity 
score with user z, i.e. if Sik > Sij] 

• Best's Random Leader (BRL): the new leader A; is a user chosen at 
random among the leaders of user i's best leader; 

• Global Memory Based (GMB^. the new leader k is the user in the 
network with the highest similarity with user i. 

Figured] shows the performance of these methods. As expected, the GMB 
strategy performs best (by achieving the highest approval fraction and the 
best assignment of leaders — see the insets of figure H]) because it makes use 
of all available information. However, this method is very expensive in terms 
of computing cycles: it involves the computation of — 1 similarity scores to 



■^GMB updating method is similar to a top L [2^ Collaborative Filtering algorithm 
(although in this case there is no spreading of news on a social network). 



8 



0.8 



I 



> 
o 



0.6 



/ 



0.4 



0.21, 



Random 
Selective 
BRL 
GMB 



5x10- 
simulation length 



5x10 
simulation length 



1x10 



O 



o 

X 



1 

I 

\ 

4-\ 



Random 
Selective 
BRL 
GMB 




5x10- 

simulation length 



5x10 
simulation length 



1x10 



Figure 1: Approval fraction and excess differences in the network for different leader up- 
dating methods. The horizontal dotted line represents the optimal assignment of leaders. 
Insets: comparison of these methods with the GMB for a smaller system {D — 10). 
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update the leaders of a single user, and 0{N'^) to update the whole network 
once. Therefore it is not scalable with respect to the system's size. We use it 
only as a benchmark for the other methods. The pure and selective random 
methods perform worse, but have a much smaller computational cost — 0{N) 
to update the whole network — and can still converge to a network state with 
a good assignment of leaders. As compared to pure random, the selective 
method may be slower in network exploration but is more careful in selec- 
tions, therefore it eventually reaches a network state as good as with the 
GMB method. On the other hand, the BRL method, while exploiting the 
local neighborhood network with computational cost 0{N * D) (it involves 
the computation of D similarity values to identify the best leader of a user), 
reaches a strongly sub-optimal network state and has the worst performance 
in approval fraction. This is because with this procedure a user can only 
exchange his leader for a two-step away user, hence he gets a very limited 
range of possibilities for selection and may not ever get in contact with his 
taste mates if they are too distant from him. 

Lower panel of figure [T] also shows that system's evolution — driven by 
any leader updating method — eventually stops when an equilibrium state is 
found. In such state, users have no advantage in changing their choices (i.e 
leaders) simply because they cannot find better ones. Thus an equilibrium 
state corresponds to an optimum for each agent. However, since users make 
decisions independently from what others have chosen, the particular equi- 
librium state reached by the system does depend strongly on the updating 
method employed (i.e. how users make their decisions) and only weakly on 
the interaction between agents (as user z's selection of leaders little influ- 
ences news arriving to user j). Hence, if the updating method allows users 
to make the best choices for themselves, the equilibrium state will be not 
only an optimum for each agent but also a global minimum, i.e. a ground 
state for the whole system. In the ground state each user is linked to his best 
taste mates: there is not a better possible configuration in terms of leader 
assignment to users. The excess differences in the ground state (represented 
in figure [T] by a horizontal dotted line) are determined by equation in 
our artificial environment: as shown in figure HI for each user first-order and 
second-order taste mates (i.e. users who differ from him in one and two 
elements of taste vectors, respectively) are indistinguishable in terms of the 
similarity ([1]). Since each user has D first-order and D{D — l)/2 second-order 
taste mates in the network and he has to chose D of them as leaders, the 
maximization of similarity is equally likely to select a first or second order 
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taste mate and hence the expected value of excess differences is 
1 



D D(D -l)/2 D-1 



D{D + l)/2 D{D + l)/2 



D + 1 



(3) 
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Figure 2: Average similarity score ([T|) versus vector differences averaged over all pairs of 
users. Similarities were computed over 1000 randomly selected news evaluated in common 
by all users. 
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Figure 3: Computational time (in arbitrary units) required to reach excess differences 
equal to 3 for different values of the updating interval u. Simulations with the selective 
random updating method. 
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Finally we remark that the frequency of the leader updating strongly in- 
fluences the convergence rate of the system. If the updating interval is too 
short, there are not enough evaluations to assess the real similarity between 
a leader and his followers, hence the leadership structure of the network de- 
teriorates. On the other hand, if the updating interval is too long there is 
redundant information for similarity evaluations and leader updating, there- 
fore the evolution of the system is unnecessarily slowed down. Figure [3] shows 
that it is possible to find an optimal value of the leader updating frequency 
which yields the best convergence rate. 

4-2. Network's Properties 

We now compare the leader updating methods from the viewpoint of 
network topology. We use two quantities to describe the network: 

• Link reciprocity, the tendency of node pairs to form connections be- 



tween each other [25|. This quantity is defined as the ratio between the 



number of bidirected links and the total number of links in a network: 

p = (4) 

where ajj = 1 if user j is one of user i's leaders (ajj = otherwise). 
Clustering coefficient, the tendency of a network to form tightly con- 



nected neighborhoods |26|. This quantity is calculated as the ratio 
between the number of triangles a user forms with his neighbors and 
the total number of possible triangles this user can form, averaged over 
all users. When edges are directed, each user can generate up to eight 
different triangles with any pair of neighbors, hence: 

(5) 



N 2[df\df'-l)-2d* 

where = + denotes the number of i's neighbors (i's total 

degree) and (i^ = Ylij '^ij^-ji is the number of bidirected links i forms 
with his neighbors. 

Figure H] shows the evolution of the link reciprocity in networks with 
different leader updating methods. In the initial random networks the av- 
erage probability of finding a reciprocal link between two connected vertices 
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Figure 4: Link reciprocity and clustering coefficient of the network for different leader 
updating methods (a is represented by > and pgs, Cgs by * in the respective panels). 



is simply equal to the average probability of finding a link between any two 
vertices, which is given by a = Ylijkj '^ij/Wi^ ~ 1)] = D/{N — 1). Hence the 
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starting value of p is equal to a. As the network evolves, the reciprocity coeffi- 
cient grows from a with all updating methods: network's structure becomes 
more and more reciprocal. This growth eventually stops when a station- 
ary state of the network is found. If the network is driven to the ground 
state (e.g. by the selective method) the value of the reciprocity becomes 
Pgs = L/[D{D + l)/2] = 2/{D + 1), because in this state each user choses his 
L leaders among D{D + l)/2 best taste-mates who are indistinguishable in 
term of the similarity score (see figure [2]): each user choose randomly among 
them and p ^ Pgs- 

The evolution of the clustering coefficient is also shown in figure HI With 
any updating method c grows from its initial value, again equal to a (the 
probability to find a link between two vertices in the initial random net- 
work). The selective method, driving the system to the optimal state, makes 
c converge to Cgs = PgsP{\\tj - 4|| < 2 | aij = aik = 1) = ^/[{D + 1){D + 2)] 
(i.e. the probability to find a link between two users j and k who are linked 
to user i and who are taste-mates of user i). Cgs, like pgs, corresponds to 
users randomly choosing leaders among their best taste-mates. Instead with 
the BRL method, where links can only form locally, the clustering coefficient 
becomes immediately very large (even bigger than Cgs): network's evolution 
is hindered because a high clustering coefficient means that news do not 
spread effectively (they get few directions to propagate and their traffic is 
limited to a subset of the network). Users who happen to be far apart are 
unlikely to read the same news: their similarity remains undiscovered and 
they cannot be connected even if they are taste-mates. Note also that with 
this method once a node has no followers, it will never get any follower in 
future because it cannot be selected as a leader, hence some users may get 
isolated from the system. These two factors cause the network evolution to 
end in a sub-optimal equilibrium state. 

5. Refined Algorithms 

In this section we propose new updating methods that strongly enhance 
the system's performance. 

5.1. Hybrid algorithm 

The BRL method has the ability to exploit the local neighborhood net- 
work, that is the most significant feature of a peer-to-peer system. Unfor- 
tunately, as we have mentioned, it has some major drawbacks caused by 
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Figure 5: Performance of the Hybrid algorithm {p — 0.75) compared to other leader 
updating methods. Inset: comparison with GMB for a smaller system {D — 10). 



the limited range of the network that this method is able to explore. We 
attempt to overcome these problems by broadening the exploration range. 
The first possible solution is to explore the whole second-order neighborhood 
(i.e., not only the subset related to the best leader), or even to go further by 
including the third-order neighborhood, fourth-order neighborhood, and so 
on. However, the computational cost of such exploration grows very fast — it 
is 0{N * D^) for a A;*''-order neighborhood — and soon becomes comparable 
with the cost of the GMB method. 

A more effective solution is to employ some randomness in the updat- 
ing method in order to have the possibility to connect users regardless of 
their distance and prevent the system from getting trapped in a sub-optimal 
state. Hence we propose a hybrid updating method that combines selective 
and BRL methods: at each updating step we employ the first with probabil- 
ity p, otherwise we employ the second. This mechanism mimics the evolution 
of self-organizing communities where users search for friends among friends 
of friends but casual encounters also occur. Figure |5] shows that the hy- 
brid method improves system's performance as compared to both selective 
and BRL methods, still keeping an affordable 0{N * D) computational cost. 
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Hence it represents a cheap and effective updating algorithm suitable for 
large systems. 

5.2. Las Vegas algorithm 

We now concentrate on the drawbacks of the above mentioned random 
updating methods. The pure random method may replace an old leader 
with a less similar one, hence it cannot reach the best assignment of leaders. 
Therefore the selective method outperforms the pure random. However the 
selective method may leave the network unchanged if it doesn't find a better 
leader (which may occur quite often), hence slowing down the system's evo- 
lution. A simple solution to this problem is to employ a selective approach 
with more trials. We name this new method Las Vegas algorithm |27|. Its 
basic idea is that if the newly randomly selected candidate leader is less sim- 
ilar than the old leader, we drop it and retry the random updating until a 
better candidate is found. Since there may not be a better candidate, we 
set the maximum number of trials to a predefined value v. The bigger this 
value, the longer the execution time, but also the higher chances to find a 
better leader. 
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Figure 6: Performance of the Las Vegas algorithm for different values of v compared to 
other updating methods. Inset: comparison with GMB for a smaller system (D = 10). 
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Figure 7: Average number of actual trials (v) of the Las Vegas algorithm with w = 50 at 
different stages of the system's evolution (represented by the value of the excess differ- 
ences). At the beginning, (v) ^ v (the fictitious starting point at {v) = only represent 
the initial network's state). As the system approaches the ground state, (v) v {V 
represents the point of convergence). 



Figure [6] shows the performance of the Las Vegas algorithm for differ- 
ent values of v. Note that the selective method is nothing but a Las Vegas 
method with v = 1. As v increases from 1, the convergence rate of the 
Las Vegas method becomes faster and faster, and it becomes even compa- 
rable with the GMB method, yet keeping the affordable computational cost 
0{N * v) <^ 0{N'^) of GMB. Moreover, the average number of actual trials 
(f ) of the Las Vegas algorithm (figure [7]) is very small at the beginning of 
system's evolution, when it is very easy to find better candidate leaders for 
users: {v) <^ v again reduces computational cost. Then the network ap- 
proaches to the ground state and {v) — )■ w as each user is already linked to 
his taste mates; however at this stage the evolution already ended. Las Vegas 
algorithm therefore improves both effectiveness and efficiency of the leader 
updating schedule, standing as a candidate mechanism to be employed in 
real recommender systems. 

6. Conclusion 

Online news websites are nowadays overloaded with a variety of fresh 
information. The hard task for readers is how to find the news they are 
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really interested in. Recommender systems are a possible answer to this 
problem. A significant yet less noticed aspect is that social relationship is 
extremely useful in delivering news to potentially interested users. 

In this work we generalized a recently proposed news recommendation 
method Q to make it more efficient and effective. This method combines 
memory-based and social recommendation: recommendation scores of news 
items are computed using users' past ratings, though recommendations come 
only from the information sources a user selects. Another important feature 
of this model is that network's topology can be continually refined according 
to similarity of user tastes, in order for news to be efficiently delivered to the 
users that are most interested in them. We proposed new stochastic algo- 
rithms which are computationally cheap (and hence scalable with system's 
size) and provide an effective way to speed up the leader updating process. 
The next step will be to develop the described system on a real platform. 

There are still some open questions to be answered. For instance, users 
do not differ only in their particular tastes but in many other aspects, like 
activity, reputation, generosity, and so on. News are also different in many 
aspects, like quality and freshness. Taking into account users' and news' het- 
erogeneity may bring to a more realistic scenario. Another important point 
is that any real recommender system must face potential attacks from ma- 
licious users or communities, that may systematically introduce spam news 
or try to intentionally mislead the system. Measures of trustworthiness and 
reputation of users can be introduced to cope with these threats. 
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